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Abstract 

Mathematical ideas and approaches common in complexity-related fields have been 
fruitfully applied in experimental high energy physics also. We briefly review some of 
the cross-pollination that is occurring. 

Traditionally, the world of High Energy Physics (HEP) has not been considered to 
be part of the loose conglomeration of topics collectively termed complexity. While the 
topics and communities are still largely disjoint, the degree and intensity of overlap has 
grown as HEP has expanded from the study of single particles (such as the top quark, 
the VP-boson and the H-meson) to precision measurements of complicated multiparticle 
systems. Commonality and mutual inspiration between HEP and complexity is to be 
sought not in the physics per se, but in the underlying mindset and approach. Specifically, 
the mathematics used in one may be transmuted and adapted to the other to good effect. 
Indeed, it is the mathematics and the mindset that unifies the diverse phenomena and 
applications making up complexity. 

This article seeks to highlight a few cases where such cross-pollination has been 
occurring. We confine ourselves specifically to experimental high energy physics in the 
knowledge that efforts to apply concepts and approaches of complexity to theoretical 
HEP would widen the scope considerably, cf. Pin- 

Events and their history: A typical HEP experiment consists of collecting a 
sample of many events. Each event occurs in three phases: in the first, two incoming 
projectiles, which can be leptons such as an electron and positron, hadrons such as protons 
or mesons, or highly ionised nuclei, are made to collide at very high energy. In Phase Two, 
the resulting ultrahigh concentration of energy is converted, according to incompletely 
understood laws, into many elementary particles which in turn collide with each other, 
transmute and decay into lower-energy particles. In the third phase, the final particles 
cease interacting and stream off into the detectors where they are recorded. Modern 
experiments typically accumulate millions of events. 

Conditions and parameters specific to HEP: For the purposes of comparing 
and contrasting the respective mathematics, the following fundamental properties of HEP 
experimental systems and data are of importance. 

(A) Recorded events are purely spatial in character (with the “phase space” Q defined 
e.g. in terms of momentum); there is no time ordering or time information. All 
mathematics relying on time ordering, such as time series analysis, is hence irrelevant 
from the start. Also, all time information of the dynamics occurring in Phase Two 
of the collision can be inferred only indirectly from the available spatial information. 

(B) Each recorded event is a point process, i.e. it consists of N structureless particles 
represented as points in the space. The multiplicity N fluctuates from event to 

^in: Second International Conference on Frontier Science: A Nonlinear World: The Real World, Pavia, 
Italy, 8-12 September 2003. Physica A338, 20-27 (2004). 


1 



event. Not all particles are picked up by the detectors; considerable effort goes into 
correcting for this. 

(C) There are many different types of particles. In general, each particle is identified in 
terms of charge, mass, etc., and its momentum is measured. 

(D) Complex systems measurements and simulations commonly deal with 10® data points. 
By contrast, each HEP event consists of very few particles, ranging from a handful 
(~ 5 for restricted measuring intervals or lower energies) to a maximum of in 
nucleus-nucleus collisions. 

(E) While each event consists of comparatively few particles, huge samples containing 
up to 10^ such events are now available. The net amount of data available in mod¬ 
ern HEP experiments exceeds complexity data from measurement or simulation by 
orders of magnitude. 

(F) The distribution of particles in the measuring space can be highly nonuniform in 
space. Typical phenomena include the formation of strongly clustered “jets” in 
lepton-lepton and hadron-hadron collisions, kinematic effects, collective flow, and 
conservation laws. 

(G) There can be differences from event to event even on a fundamental level due to un¬ 
controllable or unknown parameters such as the amount of overlap between colliding 
projectiles. 

In summary, HEP experimental data can be characterised mathematically as a large sample 
of sparse multispecies point processes. 

In both HEP and experimental complexity, the aim of characterisation quantities 
and techniques is to eliminate candidate theoretical models purporting to explain the 
results. What, then, can the HEP experimentalist learn from complexity? Two common 
dominants have emerged: the concept of scale and the use of multivariate statistics. While 
both topics have been long familiar in complexity, the properties listed above result in 
HEP-specific limitations, adaptations and opportunities, which we now examine. 

Given its point process character, the natural complexity counterpart of an HEP 
event is a strange attractor, where points x(t) of the dynamical map are plotted in the 
embedding space while discarding time information. While initially the focus of character¬ 
isation had been on purely geometric properties of attractors, the advent of the correlation 
integral [S] and multifractals |^, permitted a more complete description in terms of both 
support and measure [S]. Box multifractals, for example, are defined in terms of moments 
Pq = UmPm (q = 1)2,...) of relative frequencies pm = nm/N (with Um the number of 
particles in bin m, and N = If Pq turns out to exhibit power-law behaviour as 

a function of scale (bin size) i, Pq{i) ~ then the set of constants Dq are termed 

“generalised dimensions” or Renyi dimensions [3 [ 3 . 

The predilection in complexity towards scale invariance proved a fruitful inspiration 
to HEP experiments: scaling of suitably normalised moments was, indeed, found in many 
cases; see [Oj for reviews. The strong anisotropy in behavior parallel and perpendicular to 
the collision axis has also induced measurements of Hurst exponents [ 7 ]. Due to Property 
(D), however, a prejudice towards scaling is not helpful. Multifractals arise most natu¬ 
rally in infinite-generation multiplicative cascades [SI, and so jets, the hierarchical cascade 
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structures of particle formation in lepton-lepton collisions, would be the best candidates 
for scaling. Typical jet multiplicities are rather low, however, so that the scaling interval 
will necessarily be small. Nucleus-nucleus collisions, on the other hand, produce large 
numbers of particles, but these originate not from cascades but from a semi-thermalised 
second phase which effectively destroys information on particle histories and hence is not 
conducive to scaling. 

Factorials: To deal with the low-multiplicity problem. Bialas and Peschanski [Hj, 
noting that the dynamical variable relevant to HEP cascades was usually continuous while 
the measured particles were necessarily discrete, postulated that the transition from con¬ 
tinuous dynamics to a discrete number of particles would be a poissonian fluctuation. 
Based on this assumption, they showed that the factorial moments of n discrete par¬ 
ticles, (n(n—1) • • • (n—g'-l-l)), correspond the ordinary moments (x^) of their continuous 
ancestor x. Lipa m showed early on that an explicitly scaling continuous model with a 
continuous-to-discrete last step does continue to exhibit scaling even for low multiplicities. 

Multivariate statistics: While scaling as such was an important addition to the 
HEP vocabulary, its narrow applicability necessitated a wider approach; scaling assump¬ 
tions are permitted in multivariate statistics, but not required. While the HEP commu¬ 
nity had long made use of the latter to characterise e.g. two-particle correlations HU, 
the advent of complexity-inspired thinking has boosted its significance and sophistication 
considerably. 

The sample of eventwise measures pe, e = 1,..., A4v serves as the starting point. 
There is a direct analogy^ between the invariant measure of the strange attractor, made 
up of points at the i-th iterate of map /(x), and the measure of an event e made up of N 
particles measured at phase space points 

, N N 

N Pe(x) (1) 

i=l i=l 

Phase space integrals: Observation coordinate x and the data Xj live in an embedding 
space H. In the language of eventwise measures, multifractals and indeed all correlation 
statistics are easily seen to be integrals over subregions of products of H. An example of a 
one-dimensional case H = [0, L] is illustrated in Eig. 1 for q = 2. The original event measure 
Pe of a typical event is visualised as N dots living on the line [0, L], shown in Fig. 1 below the 
squares. A product of the measure with itself,^ /5 e(xi,X2 ) = '^(xi—XjJ (5(x2 —Xjj), 

is then a set of dots on the product space H represented by the squares in the Figure. 
Each dot represents a particle pair, and second moments for a particular domain are found 
by counting the number of dots in a given domain. 

Testing dependence on scale in box moments then amounts to integrating over bin 
i = L/2^,k = 1,2,..., counting only pairs falling in the string of (L/2*^)^-sized squares 
along the diagonal (see Fig. l(a)-(c)). As illustrated in Fig. 1(d), the correlation integral 
[S] is nothing but a series of strips of varying width parallel to the main diagonal. 
Many other “slices of phase space” have been defined, such as autocorrelations, fixed-bin 
correlations and statistics based on void intervals in the densities. 

^There are obvious differences, the first being that for a finite number N of particles, a HEP event 
cannot accommodate the limit A^—> oo needed to define the invariant measure. Clearly, pe will also differ 
radically from event to event, so there is no question of invariance on this level. 

®The unequal sign, enforcing factorial counting, subtracts out points on the diagonals, i.e. those points 
where a single particle is counted as “a pair with itself”. 
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(a) (b) (c) (d) 

Figure 1; Representation of a single event in O (shown as dots on the lines below the 
square) and in ( 8 ) space (in the squares), (a)-(c): Domains of box moments as a 
function of scale, (d): corresponding correlation integral domains. Filling in the “missing 
dots” along the diagonals would change factorial to ordinary moments. 


While higher orders q>2 are not visualised as easily, their analysis proceeds analo¬ 
gously. Issues of “topology”, i.e. the combination of pairwise interparticle distances used 
to determine ( 7 -tuple size, come into play urn- 

Basing mathematical analysis on eventwise measures was instrumental in deriving 
correlation integral prescriptions for a HEP context m and in providing a mathematical 
basis for event mixing, whereby the uncorrelated background is simulated by analysing 
artificial events made up of particles selected randomly from different events in the sample. 

Cumulants: The availability of large samples of low-multiplicity events led to the 
direct measure of cumulants, Kq CB] which have proven to be central to later experimental 
efforts due to their statistical properties and sensitivity.^ Their general properties include: 

• Cumulants are zero when there is no net correlation. 

• Cumulants of the sum of independent random variables Xj are additive, 

9 = 1,2,3,... (2) 

i 

• The cumulant Kq{A — B) of distribution A relative to distribution B is equal to the 
difference of the A- and H-distribution cumulants in, 

Kq{A - B) = Kq{A) - Kq{B) . (3) 

B may, for example, be a reference distribution and Kq{A — B) the measured devia¬ 
tions from this pre-defined reference. 

• With the exception of ki, multivariate cumulants are tensors under affine transfor¬ 
mations; a simple example is Kg(cx) = c'^Kg(x). 

• Using event mixing, cumulants can be calculated even for correlation integrals m- 

“^Their lowest-order forms, (ti = {x) = sample mean and K2 = {x^) — {x)'^ = sample variance are, of 
course, universally known. 
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An example of the usefulness of cumulants is shown in Fig. 2: measured second order 
cumulants for hadron-hadron collisions were fitted using various parametrisations (only 
the power law does well), and the same parametrisation then compared to measured third- 
order cumulants m- The discrepancy shows that the underlying model assumptions fail 
on third order even while being successful for g=2. 




2 


1.5 


1 


0.5 


0 


{+++) and (—) 


^ UAl data 
A EWS power law 
0 EWS exponential 



q^(GeV)' 


Figure 2: Use of cumulants in HEP to test theoretical models. Data was fitted to q=2 
(left) and q=3 model predictions then compared to data (right). 

Central Limit Theorem: Capitalising on the many particles generated in nucleus- 
nucleus collisions, recent experiments have exploited the close link between the scale £ of a 
region and the number n of particles it can be expected to contain. Assume, simplistically, 
a constant number of particles N. Given the additivity property m, it is easy to show 
that the cumulant of the average, {x) = is equal to the cumulant over a 

smaller region containing but one particle, suppressed by powers of N, 

Kq ((x)) = N~‘l Kq{Xi) = Kq{x) , (4) 

i 

if the Xi are independent. Sensitive testing of independence is therefore provided by mea¬ 
suring the deviation from zero of the statistic 

N'^~^Kq{{x)) - Kq{x) . (5) 

while varying the scale interval over which (x) is calculated. This can be interpreted as 
an application of Eq. ©• 

A more sophisticated version of this has recently been used m to partially address 
fluctuations in total eventwise multiplicity also: With p(m) = Pi fsum of all Um 
transverse momenta pi of particles found in bin m of size i for one event e, and p the 
average over momenta of particles from all events (“inclusive average”) in the same bin, 
the total variance, defined as 

= T ’ (6) 
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turns out to be a difference between a q=2 cumulant over bins at scale t and a cumulant 
at the smallest available scale, thereby effectively making use of Eq. ©• Multiplicity 
fluctuations are suppressed in Eq. because „ is based on bin-averaged versions of 
the standardised variable x* = [x — ki{x)]/[k 2 {x)Y/‘^ which automatically subtracts out 
mean multiplicity ki. 

Fig. 3 shows schematically how can be expected to behave as a function of scale 
I = 5x |17| . Correlations manifest themselves as significant changes in ^ with a change 
in scale, while lack of correlation shows up as invariance under change of measuring scale, 
which is another formulation of the Central Limit Theorem (CLT). 



Figure 3: Changes with scale in correlation structure of “points” (shown under various 
magnifications in the little squares). Scale intervals of constant T? signify no correlation 

[Hi- 


Other applications of cumulant differences include calculating cumulants for a fixed- 
multiplicity sample using the multinomial as reference distribution B d, and the cal¬ 
culation of quantum mechanical interference effects between the jets resulting from the 
weakly interacting bosons W~^ and W~ [TKj . 

Event-by-event physics: The availability of large samples allows the study of 
very detailed special cases or events. Selecting subsamples or plotting relative frequencies 
of events as a function of some event property has become known as “Event-by-event 
analysis”. Some have attempted to characterise individual events, for example by wavelet 
transforms m, but so far, true event-by-event analysis has been rare. Nevertheless, the 
degrees of freedom provided by the sampling hierarchy m of particles, events, sampling 
distributions, special selections etc. are only just beginning to be appreciated and ex¬ 
ploited. 
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